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ABSTRACT 

In  this  paper  we  report  on  query  processing  experiments  that 
were  performed  in  one  distributed  data  base  environment.  In 
this  environment  we  compared  the  strategy  produced  by  a  col¬ 
lection  of  algorithms  on  the  basis  of  number  of  bytes  moved. 
Among  other  conclusions  we  found  that  limited  search  algo¬ 
rithms  do  not  perform  very  well  compared  to  algorithms  which 
exhaust  all  possible  processing  plans.  , 
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I  INTRODUCTION 


There  have  been  a  large  number  of  strategies  proposed  for 
processing  high  level  language  data  base  commands.  For  cen¬ 
tralized  data  bases  these  include  [BLAS76,  GRIF79,  STON76, 
WONG76,  YOUS78].  Moreover,  there  have  recently  been 
attempts  to  extend  such  algorithms  to  handle  distributed 
data  bases  [ BERN79  ,  EPST78  ,  GOOD79 ,  HEVN78a ,  HEVN78b , 
WONG77  ]  . 

All  the  above  algorithms  attempt  to  decompose  a  query,  Q, 
into  a  collection  of  subqueries,  Q1,...,Qn  in  such  a  way 
that  the  summation  of  the  costs  of  the  individual  subqueries 
is  minimized.  In  general,  the  collection  of  subqueries  is 
at  least  partially  ordered  in  the  sense  that  Qi  uses  the 
result  of  query  Qj  for  some  j  <  i.  As  a  result  the  cost  of 
query  Qi  depends  on  the  size  of  the  result  produced  by  query 
Qj.  We  now  do  an  example  to  illustrate  this  issue. 

Consider  the  three  relations 

SUPPLIER ( S// ,  SNAME,  SLOCATION) 

PARTS(P# ,  PNAME  ,  PSIZE) 

SUPPLY ( S# ,  Pit,  QUANTITY) 

with  the  obvious  interpretation,  each  on  a  different  site  in 
a  distributed  data  base.  Consider  the  query  to  find  the 
names  of  suppliers  who  supply  bolts,  which  is  expressed  in 
QUEL  [ HELD75 ]  as  follows: 


RANGE  OF  S  IS  SUPPLIER 
RANGE  OF  P  IS  PARTS 
RANGE  OF  Y  IS  SUPPLY 
RETRIEVE  (S.SNAME)  WHERE 
S.S#  =  Y.S//  AND 
Y.P//  =  P.P//  AND 
P.PNAME  =  "bolts" 

Clearly,  we  can  restrict  PARTS  to  a  temporary  relation  TEMPI 

containing  only  the  part  numbers  for  those  parts  named  bolts 

and  pay  no  transmission  cost  at  all.  As  a  result  we  are 

left  with  the  query: 

RANGE  OF  P  IS  TEMPI 
RANGE  OF  S  IS  SUPPLIER 
RANGE  OF  Y  IS  SUPPLY 
RETRIEVE  (S.SNAME)  WHERE 
S.S//  =  Y.S //  AND 
Y.P//  =  P.P// 

To  process  this  resulting  query,  one  possible  tactic  would 
be  to  move  TEMPI  and  SUPPLY  to  the  site  where  SUPPLIER 
exists  and  process  the  above  query  at  that  site. 


Alternately,  we  could  perform  the  query: 


RANGE  OF  P  IS  TEMPI 
RANGE  OF  Y  IS  SUPPLY 
RETRIEVE  INTO  TEMP2(Y.S//)  WHERE 
Y.P  //  =  P.P// 

by  moving  TEMPI  to  the  site  of  SUPPLY.  Subsequently,  we  can 
execute 


RANGE  OF  T2  IS  TEMP2 
RANGE  OF  S  IS  SUPPLIER 
RETRIEVE  (S.SNAME)  WHERE 
S.S »  =  T2.S# 

by  moving  TEMP2  to  the  site  where  SUPPLIER  exists.  This 
second  approach  seems  a  sure  winner,  at  least  in  bytes 
moved,  because  TEMP2  is  guaranteed  to  be  smaller  than 


SUPPLY. 


However,  there  are  other  possible  ways  to  decompose  the 
query.  For  example,  we  could  process  the  query 

RANGE  OF  S  IS  SUPPLIER 
RANGE  OF  Y  IS  SUPPLY 

RETRIEVE  INTO  TEMP3 ( S . SNAME ,  Y.P#)  WHERE 
S.S#  =  Y.S# 

by  moving  either  SUPPLIER  OR  SUPPLY.  Thereafter,  we  could 
process 

RANGE  OF  T  IS  TEMPI 
RANGE  OF  T3  IS  TEMP3 
RETRIEVE  (T3- SNAME)  WHERE 
T3.P#  =  T.P# 

by  moving  either  TEMPI  or  TEMP3.  It  is  unlikely  (but  possi¬ 
ble)  that  this  option  will  be  attractive.  In  general,  the 
choice  of  the  best  strategy  depends  crucially  on  the  sizes 
of  the  intermediate  relations  {TEMPI,  TEMP2 ,  TEMP3 } . 

We  can  now  categorize  query  processing  algorithms  along  the 
following  four  dimensions. 

1)  What  information  is  used  to  estimate  the  sizes  of  inter¬ 
mediate  relations? 

2)  What  technique  is  used  to  perform  the  estimation? 

3)  Are  all  possible  query  processing  plans  evaluated  or  only 
a  subset? 

4)  Is  plan  evaluation  done  at  run  time  or  prior  to  run  time? 


U  - 


We  now  discuss  each  dimension  in  turn. 

1)  Information  available  to  estimate  sizes  of  intermediate 
results 

Query  processing  algorithms  assume  certain  information  is 
available  for  estimating  the  size  of  intermediate  results. 
For  example,  INGRES  assumes  that  only  the  cardinalities  of 
the  relations  in  question  are  known  [WONG76,  STON76]. 
Alternatively,  [GRIF79,  GOOD79]  assume  this  information  and 
also  the  number  of  unique  values  in  each  field.  Lastly, 
[HEVN78a,  HEVN78b]  require  all  this  information  plus  the 
proportion  of  any  relation  that  will  participate  in  a  join 
with  any  other  relation. 

There  is  a  continuum  of  possible  information  from  none  at 
all  to  perfect  information,  i.e.  the  knowledge  of  the  exact 
size  of  any  intermediate  relation  that  might  be  constructed. 

2)  Estimating  technique 

Based  on  some  amount  of  information  each  algorithm  must  then 
estimate  the  size  of  any  intermediate  temporary  relation. 
For  example,  the  algorithm  in  [WONG76]  suggests  using  a 
"worst  case"  estimate.  Hence,  the  result  size  of  any  join 
is  estimated  to  be  the  size  of  the  cross  product  of  the  two 
participating  relations.  In  particular,  if  R1  with  N1 
tuples  is  joined  to  H2  with  N2  tuples,  then  the  size  of  the 
result  is  estimated  to  be  N1*N2  tuples.  Alternately, 


[GRIF79]  suggests  an  estimating  technique  of  "worst 
case/10".  For  example,  the  above  join  would  be  estimated  at 
N 1 *N2/ 10  tuples. 

3)  Limited  versus  exhaustive  search 

Many  algorithms  (e.g.  [WONG78,  GRIF79,  GOOD79])  attempt  to 
limit  the  number  of  choices  examined.  A  popular  tactic  is 
to  evaluate  the  possible  processing  steps  which  can  be  taken 
next  and  choose  the  best  one  without  concern  for  the  ulti¬ 
mate  consequences  of  the  action  taken. 

These  algorithms  never  select  plans  which  result  from  apply¬ 
ing  locally  expensive  tactics.  As  such,  they  will  not  find 
a  global  optimum  solution  unless  the  global  optimum  has  the 
property  that  each  step  is  the  locally  most  advantageous 
thing  to  do. 

Alternately,  one  can  perform  an  exhaustive  search  through 
all  possible  plans  to  find  the  best  one. 

4)  Static  versus  dynamic  decision  making 

The  algorithms  in  [GRIF79]  suggest  that  a  complete  query 
processing  plan  be  constructed  in  advance  (in  fact,  at  com¬ 
pile  time).  Because  the  actual  result  sizes  are  not  avail¬ 
able  until  run  time,  each  decision  is  based  on  estimates  of 
result  sizes.  Consequently,  estimation  errors  will  be  pro¬ 
pagated  to  subsequent  decision  steps.  Alternately,  one  can 
do  dynamic  decision  making.  Here,  a  step  is  not  performed 


until  the  result  of  the  previous  step  is  completed.  Exact 
information  from  the  previous  step  can  be  used  in  selecting 
the  next  step. 

Each  of  these  dimensions  appears  to  be  very  important.  The 
amount  of  assumed  information  is  significant  because  a  data 
base  system  must  keep  reasonably  accurate  estimates  for 
needed  statistics.  Obviously,  the  cost  of  doing  so 
increases  with  the  complexity  and  number  of  such  statistics. 
To  the  extent  that  such  statistics  allow  better  strategies 
they  are  clearly  valuable. 

The  estimating  technique  is  significant  because  it  Is  cru¬ 
cial  for  static  decision  making  where  incorrect  estimates 
propagate  through  the  remainder  of  the  selection  of  a  query 
processing  plan.  Even  for  dynamic  decision  making,  poor 
estimates  may  result  in  an  inferior  strategy  being  adopted. 

Many  algorithms  suggest  limited  search  to  avoid  paying  the 
CPU  overhead  of  examining  all  possible  query  processing  pos¬ 
sibilities.  The  general  feeling  is  that  one  obtains  a 
"good"  strategy  anyway.  Consequently,  the  extra  cost  of 
examining  all  possible  plans  is  not  offset  by  the  savings 
inherent  in  a  significantly  better  strategy. 

Lastly,  generating  a  query  processing  strategy  at  compile 
time  has  the  obvious  benefit  of  saving  run  time  overhead. 
Again  the  question  arises  "is  the  penalty  for  possibly  inac¬ 
curate  estimates  persisting  throughout  the  selection  of  the 
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plan  small  enough  to  justify  the  savings  in  overhead?” 

In  this  paper  we  present  experiments  concerned  with  these 
questions.  Hence,  in  Section  2  we  indicate  the  environment 
chosen  and  the  experiments  performed.  Then  in  Section  3  we 
give  the  results  of  our  experiments  and  interpret  some  of 
the  rather  startling  results.  Lastly,  Section  4  contains 
our  conclusions. 

II  THE  EXPERIMENTAL  TECHNIQUE 

We  are  interested  in  exploring  query  processing  in  a  distri¬ 
buted  environment  and  have  suggested  a  collection  of  algo¬ 
rithms  in  [EPST78].  In  order  to  present  and  interpret  our 
results  we  must  briefly  review  these  algorithms.  The 
environment  assumed  is  a  distributed  data  base  system  where 
each  site  contains  a  subset  of  the  tuples  in  each  relation, 
i.e.  a  fragment  [ROTH77].  The  placement  of  tuples  is  con¬ 
trolled  by  a  distribution  criteria  which  specifies  the  way  a 
relation  is  partitioned  among  the  sites. 

For  example  the  SUPPLY  relation  might  be  distributed  as: 

SUPPLY  where  P#  <  500  AT  site  1 
SUPPLY  where  P#  >=  500  AT  site_2 

Beginning  with  the  query 

RANGE  OF  R 1  is  REL  1 
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RANGE  OF  Rn  is  REL_n 

RETRIEVE  TL ( R 1 , . . .  , R n )  WHERE  0(R1,...,Rn) 
one  first  runs  an  algorithm  "choose  piece".  This  algorithm 
chooses  a  subset  of  the  variables  R1,...,Kn,  say  R1,...,Rk, 
and  then  breaks  the  query  into  two  pieces.  The  first  piece 
is  directly  executed,  while  the  second  is  presented  to  the 
algorithm  to  be  possibly  split  further. 

The  two  pieces  are  respectively: 

1 )  RANGE  OF  R1  is  REL1 

(1  ) 

RANGE  OF  Hk  is  REL_k 

RETRIEVE  INTO  TEMP  TL ' ( R 1 . Rk)  WHERE  Q ' ( R 1 , . . . , Rk ) 

2)  RANGE  OF  T  IS  TEMP 
RANGE  OF  Rk+ 1  IS  RELJC+1 

(2) 

RANGE  OF  Rn  IS  REL_n 

RETRIEVE  TL ' ' ( T , Rk+ 1 . Rn)  WHERE  Q''(T,Rk+1 . Rn) 

Here,  TL '  consists  of  those  fields  which  are  actually  in  the 
ta  _et  list  of  the  query  as  well  as  those  which  appear  in 
Q ' ' .  Moreover,  Q*  consists  of  those  terms  in  Q  which 
involve  only  variables  R1,...,Rk.  TL '  '  contains  those 
fields  desired  as  output  while  Q''  contains  all  clauses  of  Q 
not  in  Q'  and  has  tuple  variables  changed  to  reflect  the 
presence  of  TEMP . 


The  previous  section  contained  two  examples  of  "choose 


piece"  at  work  splitting  a  query  on  (SUPPLIER,  SUPPLY, 
TEMPI).  The  first  split  was  into  the  pair  {(TEMPI,  SUPPLY) 
and  (TEMP2,  SUPPLIER)}  while  the  second  was  into  {(SUPPLIER, 
SUPPLY)  and  (TEMP3 ,  TEMPI)}. 

The  first  piece  of  the  query  is  executed  by  choosing  one  of 
the  relations,  Rp  and  the  number  of  processing  sites,  L. 
Then,  all  tuples  in  relations  designated  by  R1,...,Rk  except 
Rp  are  moved  in  such  a  way  that  they  are  present  at  all  L 
processing  sites.  The  relation  Rp  is  left  fragmented  among 
the  L  sites.  Consequently,  each  of  the  L  sites  can  run  the 
query  (1)  above  to  produce  a  TEMP  at  its  site.  The  compo¬ 
site  of  all  L  TEMP  relations  is  the  desired  result. 

To  illustrate  the  above  processing,  suppose  SUPPLY  is  split 
between  sites  1  and  2  as  noted  above  while  PARTS  is  at  site 
3  and  SUPPLIER  at  site  4 .  The  example  query  is  to  find  the 
names  of  bolt  suppliers  as  in  Section  1.  As  noted  earlier 
we  can  restrict  PARTS  to  the  relation  TEMPI  at  site  3  and 
pay  no  data  movement.  Thereafter,  we  might  choose  to  pro¬ 
cess  the  portion  of  the  query  involving  TEMPI  and  SUPPLY, 
i .  e . 


RANGE  OF  P  IS  TEMPI 
RANGE  OF  Y  IS  SUPPLY 
RETRIEVE  INTO  TEMP2(Y.S//)  WHERE 
Y.P#  =  P.P// 

If  we  choose  L  =  2  and  Rp  =  SUPPLY,  then  we  must  move  TEMPI 
from  site  3  to  sites  1  and  2.  Then,  we  can  run  the  above 
query  at  these  sites  each  of  which  will  produce  a  fragment 
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of  the  result,  TEMP2. 


A  complete  set  of  tactics  for  "choose  piece",  Rp  and  L  is 
given  in  [ EPST80 ] . 

The  above  algorithm  is  oriented  toward  the  possibility  that 
L>1.  One  environment  in  which  a  distributed  data  base  sys¬ 
tem  will  run  is  that  of  several  machines  connected  over  a 
local  network  which  allows  efficient  broadcasting  [ROWE793. 
In  such  an  environment  one  usually  wishes  to  choose  L  >  1 
[EPST80].  Moreover,  the  algorithm  makes  no  use  of  "semi¬ 
join"  tactics  [BERN79].  Such  tactics  may  be  very  attractive 
in  some  environments. 

To  study  this  algorithm  under  several  alternate  scenarios, 
we  coded  a  simulation  program  to  calculate  the  cost  of  vari¬ 
ous  possible  plans.  This  program  used  the  following  assump¬ 
tions  : 

1 )  The  cost  function  to  be  minimized  is  the  number  of  bytes 
of  data  moved.  Although  it  is  argued  in  [EPST78]  that  this 
is  not  always  the  sole  parameter  of  interest,  it  is  an 
important  one  which  is  easily  measured. 

2)  T  istributed  environment  is  a  collection  of  nodes  con¬ 
nected  by  a  "contention  net"  network  [METC76,  ROWE793. 
Hence,  the  cost  to  send  data  to  all  sites  is  equal  to  the 
cost  to  send  it  to  any  site.  Such  an  assumption  closely 
models  the  ETHERNET,  COCANET  or  other  local  network.  The 


purpose  of  this  assumption  is  to  1 
ters  in  the  study.  It  is  shown  in 
very  unusual  circumstances:  (1)  L 
data  moved  is  independent  of  the 
fragments  and  independent  of  the 
(L  above).  Hence,  these  parameter 
s imulation . 


imit  the  number  of  parame- 
[EPST80]  that  except  in 
>  1  and  (2)  the  amount  of 
initial  distribution  of 
number  of  processing  sites 
s  will  not  appear  in  the 


3)  Only 
[ ST0N7 6  , 
lowed  by 
required 
Hence,  it 


retrieval  commands 
WONG76]  all  updates 
lower  level  proces 
in  a  distributed  env 
is  appropriate  to  i 


are  considered.  In 
are  turned  into  retriev 
sing.  The  slight  ex 
ironment  are  given  in  [ 
nvestigate  retrievals. 


INGRES 
als  fol- 
tensions 
EPST80]  . 


*4 )  The  commands  to  be  processed  may  involve  multiple  rela¬ 
tions.  If  so,  the  relations  are  joined  by  an  equi-join  of 
the  form 


relname_1 . domain_name  =  r elname_2 . domain_name_ 2 

Moreover,  it  is  assumed  that  the  two  domain  names  are  4 
bytes  wide  and  do  not  appear  anywhere  else  in  the  command. 


The  purpose  of  this  assumption  is  to  simplify  the  bookkeep¬ 
ing  concerning  how  wide  partial  answers  are. 

5)  The  cardinality  of  each  relation  involved  in  a  command  is 
varied  over  the  set  of  values  {10  tuples,  100  tuples,  1000 
tuples}.  Moreover,  each  relation  can  either  have  fields 
present  in  the  target  list  or  not.  If  a  relation  has  fields 


2 


in  the  target  list,  it  is  assumed  to  contribute  10  bytes  per 
tuple  to  the  answer  to  the  query.  Notice  that  the  results 
which  we  will  document  scale  in  the  obvious  way  to  other 
relation  sizes.  Hence,  the  important  point  is  the  100  to  1 
range  that  is  examined. 

6)  The  actual  size  of  any  possible  temporary  relation  that 
might  be  constructed  is  available.  Algorithms  use  this 
information  in  various  ways. 

As  a  result  of  the  above  assumptions,  the  simulation  program 
accepts  as  input  the  following: 

1)  the  number  of  relations  involved 

2)  a  graph  of  the  way  the  joining  terms  interconnect  the 
relations 

3)  the  true  size  of  any  possible  join  which  might  be  per- 

f  formed 

The  program  then  is  run  for  6**n  different  combinations  of 
relation  sizes  and  presence  or  absence  of  a  contribution  to 
the  target  list,  where  n  is  the  number  of  relations 
involved. 

This  results  from: 

3  relation  sizes  per  relation 

2  widths  of  contribution  to  the  target  list  per  relation 
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In  our  simulation  we  concentrated  on  queries  spanning  4 
relations;  hence  there  are  6**4  or  1296  cases  to  consider. 
In  fact,  81  of  these  cases  have  a  null  target  list  and  are 
uninteresting.  Consequently,  we  are  left  with  1215  to  pro¬ 
cess. 

Our  strategy  was  to  choose  benchmark  queries  along  with  data 
for  the  sizes  of  all  possible  joins  and  then  to  run  through 
all  1215  test  cases.  For  each  case  we  computed  the  cost,  in 
bytes  moved,  of  the  query  processing  plan  produced  by  14 
different  algorithms.  They  correspond  to  every  reasonable 
combination  of  the  following  options. 

-information  available- 

11)  perfect  information 

Here  we  assume  that  the  algorithm  has  access  to  the  actual 
size  of  any  temporary  which  it  might  wish  to  create  and  can 
use  this  information  in  its  cost  calculations. 

12)  size  of  relation  plus  a  uniqueness  indicator 

Here  we  assume  that  the  algorithm  must  estimate  the  size  of 
any  temporary  result  and  has  at  its  disposal  the  size  of  the 
participating  relations  plus  one  bit  per  field  indicating 
whether  each  row  contains  a  unique  value  or  not  (or  some¬ 
thing  close).  Should  an  algorithm  wish  to  estimate  the  max¬ 
imum  size  of  any  join,  it  can  do  so  as  shown  in  Table  1. 
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Operation 


Estimated  Result  Size 


R1  join  R2  (both  joining  N1*N2  tuples 

fields  non  unique) 

R1  join  R2  (joining  field  N1  tuples 

in  R1  unique) 


R1  join  R2  (joining  field  N2  tuples 

in  R2  unique) 

R1  join  R2  (both  joining  min(N1,  N2)  tuples 

fields  unique) 


Maximum  Size  Estimates 
Table  1 


-estimating  technique- 

el)  Any  temporary  result  is  estimated  to  be  the  maximum  size 
from  Table  1 . 

e2)  Any  temporary  result  is  estimated  to  be  the  maximum  size 
divided  by  2. 

e3)  Any  temporary  result  is  estimated  to  be  the  maximum  size 
divided  by  10. 

-limited  versus  exhaustive  search- 

11)  Exhaustive  search  is  performed..  In  particular,  we 
iterate  over  all  possible  ways  that  a  piece  can  be  chosen  by 
"choose  piece".  For  each  such  piece  we  then  iterate  over 
all  possible  ways  the  second  piece  can  be  processed.  Conse¬ 
quently,  exhaustive  search  evaluates  all  possible  splits  and 
then  chooses  the  plan  with  lowest  estimated  total  cost.  The 


r> 


best  choice  for  Rp  is  always  the  relation  with  largest  car¬ 
dinality.  Hence,  this  is  the  one  selected. 


The  number  of  plans  which  exhaustive  search  examines  is: 

e  (  1  )  =  1 
e(2)  =  2 

r  i 

n  •  n ; 

e(n)  =  2  ‘  •'  *  e ( n-i+  1 ) 

_  i  I  i 

l  =  2  >-  -> 

where  "n"  is  the  number  of  variables  in  the  query.  For  each 
plan,  the  number  of  variables  in  the  first  piece,  i,  is 
chosen  and  then  the  remaining  query  requires  exhaustively 
examining  all  combinations  of  the  remaining  n  -  i  +  1  vari¬ 
ables.  The  number  of  actual  choices  is  limited  by  the 
user's  query,  i.e.  by  the  interconnections  of  the  variables 
in  the  graph.  All  possible  combination  must  be  tried.  For 
example,  the  number  of  plans  evaluated  for  a  four  variable 
query  is  29  and  for  a  five  variable  query  336.  Notice  that 
many  of  the  choices  may  not  have  a  clause  in  piece  1  of  the 
split  query.  As  such  they  are  degenerate  and  need  not  be 
considered.  Hence,  the  real  number  of  possibilities  exam¬ 
ined  is  much  less  than  the  maximum  amount  possible. 

12)  Limited  search  is  performed.  For  any  query  M,  limited 
search  estimates  the  cost  of  any  split  into  pieces  Ml  and  M2 
as : 

cost(M)  a  cost(MI)  +  cost(M2) 

The  cost  of  Ml  is  computed  as  the  number  of  bytes  moved. 


Moreover,  the  cost  of  M2  is  computed  AS  IF  M2  IS  NOT  SPLIT 
FURTHER.  The  lowest  cost  split  is  the  one  selected. 

Exhaustive  search  would  look  through  all  possible  ways  to 
split  M2  in  order  to  find  a  minimum  cost  first  split, 
whereas  limited  search  does  not  perform  this  "look  ahead". 
Therefore,  the  number  of  plans  which  limited  search  examines 
is: 

r  i 

n  ;  n ' 

e(n )  =  2  •  .1 

i  =  2  •-  ^ 

For  example,  it  only  needs  to  examine  at  most  11  cases  for  a 
four  variable  query  and  26  for  a  five  variable  query. 

-static  versus  dynamic  decision  making- 

si)  Static  decision  making  is  performed.  Here,  a  complete 
access  plan  is  found  prior  to  run  time.  Exhaustive  search 
is  run  once  to  find  the  best  expected  plan.  Limited  search, 
on  the  other  hand,  finds  a  choice  for  Ml  and  the  expected 
size  of  the  resulting  TEMP.  Then,  it  is  run  on  M2  and  must 
select  a  piece  to  process  using  only  its  initial  information 
and  the  expected  size  of  TEMP.  The  algorithm  is  run  until  a 
piece  is  left  which  cannot  be  split. 

s2)  Dynamic  decision  making  is  performed.  Either  exhaustive 
search  or  limited  search  is  run  to  find  a  first  piece  to 
process.  Then,  the  query  Ml  is  run  and  the  ACTUAL  SIZE  of 
the  TEMP  produced  is  available.  Either  algorithm  is  then 
rerun  on  the  query  M2  with  this  extra  information. 


17 


The  actual  algorithms  which  we  tested  are  the  following: 

1)  (perfect  information,  exhaustive) 

This  algorithm  examines  all  possible  ways  to  process  the 
query  and  has  perfect  information.  Hence,  it  must  neces¬ 
sarily  find  the  optimum  plan.  Of  course,  this  information 
would  never  be  available  in  practice;  rather  this  answer 
represents  the  best  which  could  be  done  and  is  reminiscent 
of  the  OPT  algorithm  for  buffer  management  [MATT70]. 

2)  (limited  information,  maximum  size,  exhaustive,  static) 

The  number  of  bytes  moved  by  an  algorithm  which  enumerates 
all  possible  plans  is  found.  However,  this  algorithm  ESTI¬ 
MATES  the  size  of  any  TEMP  relation  from  Table  1.  Moreover, 
this  algorithm  finds  a  complete  access  plan  in  advance  of 
running  any  pieces  of  the  query  so  it  can  obtain  no  feedback 
on  the  actual  size  of  any  relation  that  it  created. 

3)  (limited  information,  maximum  size/2,  exhaustive,  static) 

This  algorithm  is  the  same  as  algorithm  2)  except  that  it 
estimates  the  size  of  any  TEMP  relation  as  the  size  from 
Table  1  divided  by  2. 

M)  (limited  information,  maximum  size/10,  exhaustive, 
static) 

This  algorithm  is  the  same  as  3)  above  except  it  uses  a  fac- 


:1 


tor  of  10  instead  of  a  factor  of  2. 

5)  (limited  information,  maximum  size,  exhaustive,  dynamic) 

This  algorithm  behaves  like  2)  in  many  ways.  It  enumerates 
all  possible  decisions  to  find  the  best  processing  strategy. 
Moreover,  it  uses  the  same  estimating  procedure  as  that  used 
by  algorithm  2).  Then,  it  performs  the  first  step  of  the 
plan  and  obtains  the  ACTUAL  size  of  the  TEMP  relation  which 
it  created.  Subsequently,  it  uses  this  additional  informa¬ 
tion  to  create  a  new  plan  for  processing  piece  2. 

It  is  true  that  algorithms  2)  and  5)  will  choose  the  same 
initial  piece  to  process.  However,  5)  obtains  the  true  size 
of  TEMP  while  2)  must  use  its  estimate.  Hence,  subsequent 
steps  may  diverge  between  the  two  cases. 

6)  (limited  information,  maximum  size/2,  exhaustive, 

dynamic) 

This  is  the  same  as  5)  except  estimates  are  set  to  be  the 
maximum  size  divided  by  2. 

7)  (limited  information,  maximum  size/10,  exhaustive, 

dynamic ) 

This  is  the  same  as  6)  except  a  factor  of  10  is  used. 

8)  —14)  (limited  search) 

Algorithms  1)  to  7)  were  repeated  for  a  search  strategy 
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which  used  the  limited  search  tactic  mentioned  above. 


In  the  next  section  we  present  a  sampling  of  our  results. 
Ill  EXPERIMENTAL  RESULTS 


Limited  and  exhaustive  search  will  differ  in  the  number  of 
cases  considered  only  if  the  number  of  relations  involved  in 
a  query  is  four  or  more.  Hence,  we  consider  the  following 
representative  four  relation  command: 

RETRIEVE  (TL) 

WHERE  R1 .e  =  R2.  f  and 
R2.g  =  R3-h  and 
R3  •  i  =  R4.j 

(3) 


Notice  that  (3)  uses  pairwise  natural  joins  as  the  qualifi¬ 
cation.  In  the  target  list,  a  10  byte  field  from  each  rela¬ 
tion  can  optionally  be  present.  As  noted  earlier,  the  pres¬ 
ence  of  such  a  field  is  a  simulation  variable. 


Table  2  indicates  the  two  sets  of  test  data  which  we  use. 


A  test  data  B  test  data 


R 1 

join 

R2 

10 

100 

R2 

join 

R3 

20 

200 

R3 

join 

R4 

5 

2 

-  20  - 


Number  of  Tuples  in  the  Various  Joins 
Table  2 


For  example,  using  the  A  test  data,  if  one  chooses  to  pro¬ 
cess: 

RETRIEVE  INTO  TEMP ( R3 . e )  WHERE  R3.f  =  R4.j 

then  TEMP  will  be  a  relation  containing  5  tuples. 

These  join  sizes  have  been  assumed  constant  regardless  of 
the  sizes  of  R1,...,R4.  In  this  way  we  are  implicitly  vary¬ 
ing  the  percentage  of  the  cross  product  which  remains  in  the 
join  between  100  percent  and  .0002  percent. 

For  both  test  situations  we  assume  that  the  joining  fields 
are  unique  values.  Consequently,  maximum  size  estimates 
were  obtained  from  r  w  4  of  Table  1 . 

We  now  present  some  of  our  results  for  the  A  test  data. 
First,  we  treat  the  issue  of  limited  versus  exhaustive 
search  with  perfect  information  by  comparing  algorithms  1 
and  8.  Both  strategies  will  find  a  query  processing  plan 

for  each  of  the  1215  cases  examined.  In  Figure  1  we  present 

100  of  the  1215  points  sorted  in  descending  order  of  the 

difference  in  magnitude  f  bytes  moved  between  the  two  algo¬ 
rithms.  The  first  20  cases  show  an  order  of  magnitude 
difference  between  limited  and  exhaustive  search! 

Figure  2  summarizes  limited  versus  exhaustive  search  in  a 


different  way.  Here,  we  plot  the  percentage  of  cases  in 
which  limited  search  performs  within  a  given  factor  of 
exhaustive  search.  Note  that  59  percent  of  the  time  the  two 
strategies  yield  the  same  algorithm  and  achieve  the  same 
cost  (relative  performance  of  limited  search  is  0  percent 
worse).  However,  in  only  75  percent  of  the  cases  does  lim¬ 
ited  search  come  within  a  factor  of  two  (relative  perfor¬ 
mance  100  percent  worse)  of  exhaustive  search.  Lastly,  in 
only  90  percent  of  the  cases  can  it  come  within  a  factor  of 
three  (relative  performance  200  percent  worse). 

In  order  to  intuitively  understand  what  is  happening,  we 
examine  in  detail  the  worst  of  the  1215  cases.  This  case 
corresponds  to  relations  R1,  R3  and  R4  having  cardinalities 
of  1000  and  target  list  sizes  of  10;  and  R2  having  a  cardi¬ 
nality  of  10  and  no  target  list.  In  that  case,  the  cost  to 
perform  the  query  using  exhaustive  search  was  380  bytes. 
The  cost  for  limited  search  was  1^,200  bytes.  Exhaustive 
search  broke  the  query  into  three,  two  variable  pieces. 
First,  R2  was  joined  to  R 3 ;  then  the  result  was  joined  to  R4 
and  finally  the  second  result  was  joined  to  R1  producing  the 
answer.  On  the  other  hand,  limited  search  broke  the  query 
into  two  pieces,  R1  joined  with  R2  followed  by  the  remainder 
of  the  query  as  one  piece. 

In  more  detail  exhaustive  search  creates  the  following  algo¬ 


rithm. 


step  procedure  cost  in  bytes  moved 

join  R2  and  R3  move  R2  80 

join  result  and  R4  move  result  180 

join  result  and  R1  move  result  120 

380 


In  order  for  limited  search  to  find  the  optimal  strategy,  it 
would  have  to  find  that  R2  joined  to  R3  was  the  first  piece 
to  process.  The  cost  for  that  split  would  be 


step 

join  R2  and  R3 

join  result  to 
R 1  and  R4 


procedure 

move  R2 

move  result 
and  R  1 


cost 

80 

180  +  14000 
14260 


However,  the  strategy  it  chose  was 


step 

join  R3  and  R4 

join  result 
to  R1  and  R2 


procedure  cost 
move  R4  14000 
move  result 

and  R2  80  +  120 
14200 


Thus  the  minimum  cost  for  a  2-3  split  was  to  start  with  the 
( R3 ,  R4)  piece.  Notice  that  by  doing  so,  limited  search 
does  an  expensive  move  for  the  first  piece  but  leaves  itself 
with  two  inexpensive  moves  for  the  second  piece.  Because 
limited  search  does  not  break  down  all  possible  ways  to  pro¬ 
cess  the  second  piece  until  AFTER  it  chooses  the  first 
piece,  it  makes  an  expensive  error.  The  heart  of  the  prob¬ 
lem  is  that  it  has  an  inaccurate  estimate  for  the  cost  of 
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processing  the  second  piece. 


Figure  3  contains  the  same  information  as  Figure  1  for  algo¬ 
rithms  1,  5  and  6.  Here,  we  are  comparing  the  estimation 
technique.  Algorithm  1  requires  perfect  information  to 
operate.  However,  algorithm  5  makes  maximum  size  estimates 
while  6)  uses  maximum  size/2.  Notice  that  1)  dramatically 
outperforms  5)  but  is  only  marginally  better  than  6). 
Presumably,  this  is  because  maximum  size/2  is  a  much  more 
accurate  estimate  of  the  true  result  size  than  maximum  size. 
On  the  basis  of  this  more  accurate  information  it  should  be 
able  to  perform  better. 

We  now  turn  to  a  comparison  of  static  and  dynamic  decision 
making.  Figure  4  compares  the  performance  of  algorithms  5 
and  12.  Notice  that  there  are  cases  when  dynamic  decision 
making  does  signi f icantl y  better  than  static.  The  overall 
difference,  however,  is  not  very  large. 

To  assign  an  overall  rating  to  each  of  the  14  algorithms 
mentioned  previously,  a  single  number  was  derived  for  each 
one.  This  was  done  by  averaging  the  cost  for  all  1215  cases 
for  query  (3).  This  represents  a  very  crude  performance 
measure  since  it  assumes  that  each  situation  has  an  equal 
probability  of  occuring.  Table  3  shows  the  average  perfor¬ 
mance  for  A  and  B  test  da 
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A  test  data 


exhaustive 

limited 

1 

perfect 

information 

508 

1086 

2 

maximum 

size,  static 

860 

1070 

3 

maximum 

size/2,  static 

545 

1257 

4 

maximum 

size/10,  static 

536 

1058 

5 

maximum 

size,  dynamic 

755 

927 

6 

maximum 

size/2,  dynamic 

537 

1252 

7 

maximum 

size/10  ,  dynamic 

531 

1053 

B  test 

data 

exhaustive 

limited 

1 

perfect 

information 

551 

1036 

2 

maximum 

size,  static 

1017 

1210 

3 

maximum 

size/2,  static 

781 

1420 

4 

maximum 

size/10,  static 

774 

1229 

5 

maximum 

size,  dynamic 

995 

1  128 

6 

maximum 

size/2,  dynamic 

746 

1403 

7 

maximum 

size/10,  dynamic 

740 

1207 

Average  Performance  in  Bytes  Moved 
Table  3 


The  results  indicate  that  exhaustive  search  performs 
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consistently  better  than  limited  search  for  both  sets  of 
test  data.  Moreover,  the  performance  of  exhaustive  search 
is  very  sensitive  to  the  estimation  procedure  used.  A  max¬ 
imum  size  estimate  was  significantly  worse  than  either  max¬ 
imum  size/2  or  maximum  size/10.  This  indicates  that  maximum 
size  estimation  is  simply  too  pessimistic  to  use.  On  the 
other  hand,  note  that  maximum  size  is  a  viable  tactic  when 
limited  search  is  employed. 

Dynamic  decision  making  tends  to  do  somewhat  better  than 
static  decision  making  but  the  difference  is  around  10  per¬ 
cent.  Hence,  dynamic  decision  making  is  only  cost  effective 
if  it  is  very  inexpensive. 

We  ran  several  alternate  queries  with  similar  results.  In 
Table  4  we  show  values  for  one  of  them: 

RETRIEVE  (Rl.a,  R2.b,  R3.c,  R4.d) 

WHERE  Rl.e  =  R2.f  and 
R2.g  =  R3.h  and 
R3-i  =  R4.j  and 
R4 . 1  =  Rl  .m 

(4) 

Here,  a  joining  term  linking  R4  to  Rl  is  added  to  the  qual¬ 
ification  of  query  (3)  and  we  use  the  A  test  data  augmented 
by: 

R4  join  Rl  =  20  tuples 


The  last  experiment  which  we  report  uses  query  (3)  and  the  B 
test  data.  However,  we  assume  that  the  join  fields  are  not 
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A  test  data 


exhaustive 

limited 

1 

perfect 

information 

631 

1340 

2 

maximum 

size,  static 

1259 

1289 

3 

maximum 

size/2,  static 

724 

2128 

4 

maximum 

size/10,  static 

724 

1475 

5 

maximum 

size,  dynamic 

1185 

1183 

6 

maximum 

size/2,  dynamic 

721 

2128 

7 

maximum 

size/ 10 ,  dynamic 

721 

1471 

Average  Performance  in  Bytes  Moved 
Table  4 


unique.  Hence,  from  Table  1  maximum  size  estimates  will  be 
the  full  cross  product  and  very  pessimistic.  In  addition, 
we  include  maximum  size/1000  as  a  tactic  in  place  of  maximum 
size . 

The  results  are  shown  in  Table  5.  Notice  that  maximum 
size/1000  is  the  only  estimation  technique  which  produces  a 
reasonable  plan. 

IV  CONCLUSIONS 

The  following  conclusions  hold  in  the  environment  which  we 
have  simulated: 
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B  test  data 


exhaustive 

limited 

1 

perfect 

information 

508 

1086 

2 

maximum 

size/2,  -atic 

3300 

3549 

3 

maximum 

size/10,  static 

3206 

3319 

4 

maximum 

size/1000,  static 

876 

1290 

5 

maximum 

size/2,  dynamic 

3372 

354  1 

6 

maximum 

size/10,  dynamic 

3307 

3179 

7 

maximum 

size/1000,  dynamic 

852 

1240 

Average  Performance  in  Bytes  Moved 
Table  5 


1)  limited  search  performs  poorly 

There  is  a  dramatic  difference  between  the  quality  of  the 
plans  produced  by  limited  and  exhaustive  search.  For  four 
and  five  relation  queries  the  cost  of  exhaustive  search  in 
CPU  time  is  sure  to  be  well  worth  it. 

2)  good  result  size  estimates  are  crucial 

Maximum  size  estimates  are  dramatically  inferior  to  maximum 
size/2  or  maximum  size/10,  at  least  for  exhaustive  search. 
Maximum  size/10  (as  assumed  by  CGRIF793 )  appears  to  be  a 
generally  good  tactic,  although  there  are  situations  where 


it  is  too 


pessimistic. 


3) 


dynamic  decision  making 


is  beneficial 


Dynamic  decision  making  does 
decision  making;  however, 
has  greater  run-time  cost,  it 


consistently  better  than 
Because  dynamic  decision 
may  not  a  big  winner. 


static 

making 


15000 


CASE  NUMBER 


COST 
(in  bytes) 
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